Solving Classification Problems for Large Sets of Protein Sequences with the Example of Hox and ParaHox Proteins
نویسندگان
چکیده
Phylogenetic methods are key to providing models for how a given protein family evolved. However, these methods run into difficulties when sequence divergence is either too low or too high. Here, we provide a case study of Hox and ParaHox proteins so that additional insights can be gained using a new computational approach to help solve old classification problems. For two (Gsx and Cdx) out of three ParaHox proteins the assignments differ between the currently most established view and four alternative scenarios. We use a non-phylogenetic, pairwise-sequence-similarity-based method to assess which of the previous predictions, if any, are best supported by the sequence-similarity relationships between Hox and ParaHox proteins. The overall sequence-similarities show Gsx to be most similar to Hox2-3, and Cdx to be most similar to Hox4-8. The results indicate that a purely pairwise-sequence-similarity-based approach can provide additional information not only when phylogenetic inference methods have insufficient information to provide reliable classifications (as was shown previously for central Hox proteins), but also when the sequence variation is so high that the resulting phylogenetic reconstructions are likely plagued by long-branch-attraction artifacts.
منابع مشابه
GENERATING FUZZY RULES FOR PROTEIN CLASSIFICATION
This paper considers the generation of some interpretable fuzzy rules for assigning an amino acid sequence into the appropriate protein superfamily. Since the main objective of this classifier is the interpretability of rules, we have used the distribution of amino acids in the sequences of proteins as features. These features are the occurrence probabilities of six exchange groups in the seque...
متن کاملIs the evolution of Cnox-2 Hox/ParaHox genes "multicolored" and "polygenealogical?".
Understanding the evolution of metazoan bauplans is linked to understanding the evolution of Hox and ParaHox genes. At the base of metazoan radiation we see in both cases a quite confusing picture yet. Here Cnox-2 is one of the best studied diploblast Hox genes. Homologs of this gene are known from Placozoa and several Cnidaria. In those cases where full length gene sequences, or at least full ...
متن کاملA New Balancing and Ranking Method based on Hesitant Fuzzy Sets for Solving Decision-making Problems under Uncertainty
The purpose of this paper is to extend a new balancing and ranking method to handle uncertainty for a multiple attribute analysis under a hesitant fuzzy environment. The presented hesitant fuzzy balancing and ranking (HF-BR) method does not require attributes’ weights through the process of multiple attribute decision making (MADM) under hesitant conditions. For the rating of possible alternati...
متن کاملHox and ParaHox genes in Nemertodermatida, a basal bilaterian clade.
Molecular evidence suggests that Acoelomorpha, a proposed phylum composed of acoel and Nemertodermatida flatworms, are the most basal bilaterian animals. Hox and ParaHox gene complements characterised so far in acoels consist of a small set of genes, comprising representatives of anterior, central and posterior genes, altogether Hox and ParaHox, but no PG3-Xlox representatives have been reporte...
متن کاملA New Compromise Decision-making Model based on TOPSIS and VIKOR for Solving Multi-objective Large-scale Programming Problems with a Block Angular Structure under Uncertainty
This paper proposes a compromise model, based on a new method, to solve the multi-objective large-scale linear programming (MOLSLP) problems with block angular structure involving fuzzy parameters. The problem involves fuzzy parameters in the objective functions and constraints. In this compromise programming method, two concepts are considered simultaneously. First of them is that the optimal ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 4 شماره
صفحات -
تاریخ انتشار 2016